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<120> METHODS FOR IDENTIFYING A BIOSYNTHETIC PATHWAY GENE PRODUCT 



<130> BERK-032 



<150> US2004/032407 
<151> 2004-09-29 

<150> 60/507,220 
<151> 2003-09-29 

<160> 37 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 1 

gatctgcagt aggaggaatt aaccatgcat taccgttctt aact 44 

<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 2 

ttgatctgcc tcctatgaag tccatggtaa att 33 

<210> 3 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 3 

acttcatagg aggcagatca aatgtcagag ttgagagcct tc 42 

<210> 4 
<211> 33 
<212> DNA 



1 



<213> Artificial Sequence 



<220> 

<223> primer 



<400> 4 

gagtattacc tcctatttat caagataagt ttc 



33 



<210> 5 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 5 

gataaatagg aggtaatact catgaccgtt tacacagcat cc 42 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 7 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 7 

gatgtcgact aggaggaata taaaatgaaa aattgtgtca tcgtc 45 

<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 9 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 9 



<400> 6 

tacctgcagt tattcctttg gtagaccagt 



30 



<400> 8 

ttagctgtcc tccttaattc aaccgttcaa tcac 



34 



2 



gatgtcgaca ggaggacagc taaatgaaac tctcaactaa actttg 

<210> 10 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 10 

agtgtaatcc tccttatttt ttaacatcgt aag 

<210> 11 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 11 

ttaaaaaata aggaggatta cactatggtt ttaaccaata aaacag 

<2t0> 12 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 12 

atcgtcgact taggatttaa tgcaggtgac ggacc 

<210> 13 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 13 

atcccgggag gaggattact atatgcaaac ggaacacgtc 

<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 14 

atcccgggtt atttaagctg ggtaaatg 

<210> 15 
<211> 43 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 15 

agatccgcgg aggaggaatg agtaatggac tttccgcagc aac 43 

<210> 16 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 16 

agtgagagct cttatttatt acgctggatg atgttgggct agcaggagga attcaccatg 60 
agttttgata ttgccaaata c 81 

<210> 17 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 17 

tctgagcaac gaacgaagca tatatttatg tcctccaggc cttgattttg 50 

<210> 18 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 18 

caaaatcaag gcctggagga cataaatata tgcttcgttc gttgctcaga 50 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 19 

gcatccatgg tatcatcctc cgttgatgtg atg 33 

<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 20 
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tgataccatg gactttccgc agcaactcg 



29 



<210> 21 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 21 

gtacatgcat ttatttatta cgctggatga tg 32 

<210> 22 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 22 

gggtaccggg ccccccctcg cctctagagt cgactaggag gaattcacca tgagttttg 59 

<210> 23 
<211> 1671 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic amorphadiene synthase gene 
<400> 23 

gattaaggca tgcaccatgg ccctgaccga agagaaaccg atccgcccga tcgctaactt 60 
cccgccgtct atctggggtg accagttcct gatctacgaa aagcaggttg agcagggtgt 120 
tgaacagatc gtaaacgacc tgaagaaaga agttcgtcag ctgctgaaag aagctctgga 180 
catcccgatg aaacacgcta acctgctgaa actgatcgac gagatccagc gtctgggtat 240 
cccgtaccac ttcgaacgcg aaatcgacca cgcactgcag tgcatctacg aaacctacgg 300 
cgacaactgg aacggcgacc gttcttctct gtggtttcgt ctgatgcgta aacagggcta 360 
ctacgttacc tgtgacgttt ttaacaacta caaggacaag . aacggtgctt tcaaacagtc 420 
tctggctaac gacgttgaag gcctgctgga actgtacgaa gcgacctcca tgcgtgtacc 4 80 
gggtgaaatc atcctggagg acgcgctggg tttcacccgt tctcgtctgt ccattatgac 540 
taaagacgct ttctctacta acccggctct gttcaccgaa atccagcgtg ctctgaaaca 600 
gccgctgtgg aaacgtctgc cgcgtatcga agcagcacag tacattccgt tttaccagca 660 
gcaggactct cacaacaaga ccctgctgaa actggctaag ctggaattca acctgctgca 720 
gtctctgcac aaagaagaac tgtctcacgt ttgtaagtgg tggaaggcat ttgacatcaa 780 
gaaaaacgcg ccgtgcctgc gtgaccgtat cgttgaatgt tacttctggg gtctgggttc 840 
tggttatgaa ccacagtact cccgtgcacg tgtgttcttc actaaagctg tagctgttat 900 
caccctgatc gatgacactt acgatgctta cggcacctac gaagaactga agatctttac 960 
tgaagctgta gaacgctggt ctatcacttg cctggacact ctgccggagt acatgaaacc 1020 
gatctacaaa ctgttcatgg atacctacac cgaaatggag gaattcctgg caaaagaagg 1080 
ccgtaccgac ctgttcaact gcggtaaaga gtttgttaaa gaattcgtac gtaacctgat 1140 
ggttgaagct aaatgggcta acgaaggcca tatcccgact accgaagaac atgacccggt 1200 
tgttatcatc accggcggtg caaacctgct gaccaccact tgctatctgg gtatgtccga 1260 
catctttacc aaggaatctg ttgaatgggc tgtttctgca ccgccgctgt tccgttactc 1320 
cggtattctg ggtcgtcgtc tgaacgacct gatgacccac aaagcagagc aggaacgtaa 1380 
acactcttcc tcctctctgg aatcctacat gaaggaatat aacgttaacg aggagtacgc 1440 
acagactctg atctataaag aagttgaaga cgtatggaaa gacatcaacc gtgaatacct 1500 
gactactaaa aacatcccgc gcccgctgct gatggcagta atctacctgt gccagttcct 1560 
ggaagtacag tacgctggta aagataactt cactcgcatg ggcgacgaat acaaacacct 1620 
gatcaaatcc ctgctggttt acccgatgtc catctgatcc cgggattaga t 1671 
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<210> 24 
<211> 1671 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic amorphadiene synthase gene 



<400> 24 

ctaattccgt 

gggcggcaga 

acttgtctag 

gtagggctac 

gggcatggtg 

gctgttgacc 

gatgcaatgg 

agaccgattg 

cccactttag 

atttctgcga 

cggcgacacc 

cgtcctgaga 

cagagacgtg 

ctttttgcgc 

accaatactt 

gtgggactag 

acttcgacat 

ctagatgttt 

ggcatggctg 

ccaacttcga 

acaatagtag 

gtagaaatgg 

gccataagac 

tgtgagaagg 

tgtctgagac 

ctgatgattt 

ccttcatgtc 

ctagtttagg 



acgtggtacc 
tagaccccac 
catttgctgg 
tttgtgcgat 
aagcttgcgc 
ttgccgctgg 
acactgcaaa 
ctgcaacttc 
taggacctcc 
aagagatgat 
tttgcagacg 
gtgttgttct 
tttcttcttg 
ggcacggacg 
ggtgtcatga 
ctactgtgaa 
cttgcgacca 
gacaagtacc 
gacaagttga 
tttacccgat 
tggccgccac 
ttccttagac 
ccagcagcag 
aggagagacc 
tagatatttc 
ttgtagggcg 
atgcgaccat 
gacgaccaaa 



gggactggct 
tggtcaagga 
acttctttct 
tggacgactt 
tttagctggt 
caagaagaga 
aattgttgat 
cggacgacct 
tgcgcgaccc 
tgggccgaga 
gcgcatagct 
gggacgactt 
acagagtgca 
cactggcata 
gggcacgtgc 
tgctacgaat 
gatagtgaac 
tatggatgtg 
cgccatttct 
tgcttccggt 
gtttggacga 
aacttacccg 
acttgctgga 
ttaggatgta 
ttcaacttct 
cgggcgacga 
ttctattgaa 
tgggctacag 



tctctttggc 
ctagatgctt 
tcaagcagtc 
tgactagctg 
gcgtgacgtc 
caccaaagca 
gttcctgttc 
tgacatgctt 
aaagtgggca 
caagtggctt 
tcgtcgtgtc 
tgaccgattc 
aacattcacc 
gcaacttaca 
acacaagaag 
gccgtggatg 
ggacctgtga 
gctttacctc 
caaacaattt 
atagggctga 
ctggtggtga 
acaaagacgt 
ctactgggtg 
cttccttata 
gcataccttt 
ctaccgtcat 
gtgagcgtac 
gtagactagg 



taggcgggct 
ttcgtccaac 
gacgactttc 
ctctaggtcg 
acgtagatgc 
gactacgcat 
ttgccacgaa 
cgctggaggt 
agagcagaca 
taggtcgcac 
atgtaaggca 
gaccttaagt 
accttccgta 
atgaagaccc 
tgatttcgac 
cttcttgact 
gacggcctca 
cttaaggacc 
cttaagcatg 
tggcttcttg 
acgatagacc 
ggcggcgaca 
tttcgtctcg 
ttgcaattgc 
ctgtagttgg 
tagatggaca 
ccgctgctta 
gccctaatct 



agcgattgaa 
tcgtcccaca 
ttcgagacct 
cagacccata 
tttggatgcc 
ttgtcccgat 
agtttgtcag 
acgcacatgg 
ggtaatactg 
gagactttgt 
aaatggtcgt 
tggacgacgt 
aactgtagtt 
cagacccaag 
atcgacaata 
tctagaaatg 
tgtactttgg 
gttttcttcc 
cattggacta 
tactgggcca 
catacaggct 
aggcaatgag 
tccttgcatt 
tcctcatgcg 
cacttatgga 
cggtcaagga 
tgtttgtgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1671 



<210> 25 
<211> 1185 
<212> DNA 

<213> Escherichia coli 



<400> 25 

atgaaaaatt gtgtcatcgt cagtgcggta cgtactgcta tcggtagttt taacggttca 60 
ctcgcttcca ccagcgccat cgacctgggg gcgacagtaa ttaaagccgc cattgaacgt 120 
gcaaaaatcg attcacaaca cgttgatgaa gtgattatgg gtaacgtgtt acaagccggg 180 
ctggggcaaa atccggcgcg tcaggcactg ttaaaaagcg ggctggcaga aacggtgtgc 240 
ggattcacgg tcaataaagt atgtggttcg ggtcttaaaa gtgtggcgct tgccgcccag 300 
gccattcagg caggtcaggc gcagagcatt gtggcggggg gtatggaaaa tatgagttta 360 
gccccctact tactcgatgc aaaagcacgc tctggttatc gtcttggaga cggacaggtt 420 
tatgacgtaa tcctgcgcga tggcctgatg tgcgccaccc atggttatca tatggggatt 480 
accgccgaaa acgtggctaa agagtacgga attacccgtg aaatgcagga tgaactggcg 540 
ctacattcac agcgtaaagc ggcagccgca attgagtccg gtgcttttac agccgaaatc 600 
gtcccggtaa atgttgtcac tcgaaagaaa accttcgtct tcagtcaaga cgaattcccg 660 
aaagcgaatt caacggctga agcgttaggt gcattgcgcc cggccttcga taaagcagga 720 
acagtcaccg ctgggaacgc gtctggtatt aacgacggtg ctgccgctct ggtgattatg 780 
gaagaatctg cggcgctggc agcaggcctt acccccctgg ctcgcattaa aagttatgcc 840 
agcggtggcg tgccccccgc attgatgggt atggggccag tacctgccac gcaaaaagcg 900 
ttacaactgg cggggctgca actggcggat attgatctca ttgaggctaa tgaagcattt 960 
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gctgcacagt tccttgccgt tgggaaaaac ctgggctttg attctgagaa agtgaatgtc 1020 

aacggcgggg ccatcgcgct cgggcatcct atcggtgcca gtggtgctcg tattctggtc 1080 

acactattac atgccatgca ggcacgcgat aaaacgctgg ggctggcaac actgtgcatt 1140 

ggcggcggtc agggaattgc gatggtgatt gaacggttga attaa 1185 

<210> 26 
<211> 1476 
<212> DNA 

<213> Escherichia coli 
<400> 26 

atgaaactct caaetaaact ttgttggtgt ggtattaaag gaagacttag gccgcaaaag 60 
caacaacaat tacacaatac aaacttgcaa atgactgaac taaaaaaaca aaagaccgct 120 
gaacaaaaaa ccagacctca aaatgtcggt attaaaggta tccaaattta catcccaact 180 
caatgtgtca accaatctga gctagaga'aa tttgatggcg tttctcaagg taaatacaca 240 
attggtctgg gccaaaccaa catgtctttt gtcaatgaca gagaagatat ctactcgatg 300 
tccctaactg ttttgtctaa gttgatcaag agttacaaca tcgacaccaa caaaattggt 360 
agattagaag tcggtactga aactctgatt gacaagtcca agtctgtcaa gtctgtcttg 420 
atgcaattgt ttggtgaaaa cactgacgtc gaaggtattg acacgcttaa tgcctgttac 480 
ggtggtacca acgcgttgtt caactctttg aactggattg aatctaacgc atgggatggt 540 
agagacgcca ttgtagtttg cggtgatatt gccatctacg ataagggtgc cgcaagacca 600 
accggtggtg ccggtactgt tgctatgtgg atcggtcctg atgctccaat tgtatttgac 660 
tctgtaagag cttcttacat ggaacacgcc tacgattttt acaagccaga tttcaccagc 720 
gaatatcctt acgtcgatgg tcatttttca ttaacttgtt acgtcaaggc tcttgatcaa 780 
gtttacaaga gttattccaa gaaggctatt tctaaagggt tggttagcga tcccgctggt 840 
tcggatgctt tgaacgtttt gaaatatttc gactacaacg ttttccatgt tccaacctgt 900 
aaattggtca caaaatcata cggtagatta ctatataacg atttcagagc caatcctcaa 960 
ttgttcccag aagttgacgc cgaattagct actcgcgatt atgacgaatc tttaaccgat 1020 
aagaacattg aaaaaacttt tgttaatgtt gctaagccat tccacaaaga gagagttgcc 1080 
caatctttga ttgttccaac aaacacaggt aacatgtaca ccgcatctgt ttatgccgcc 1140 
tttgcatctc tattaaacta tgttggatct gacgacttac aaggcaagcg tgttggttta 1200 
ttttcttacg gttccggttt agctgcatct ctatattctt gcaaaattgt tggtgacgtc 1260 
caacatatta tcaaggaatt agatattact aacaaattag ccaagagaat caccgaaact 1320 
ccaaaggatt acgaagctgc catcgaattg agagaaaatg cccatttgaa gaagaacttc 1380 
aaacctcaag gttccattga gcatttgcaa agtggtgttt actacttgac caacatcgat 1440 
gacaaattta gaagatctta cgatgttaaa aaataa 1476 

<210> 27 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Truncated HMG-CoA reductase nucleotide sequence 
<400> 27 

atggttttaa ccaataaaac agtcatttct ggatcgaaag tcaaaagttt atcatctgcg 60 

caatcgagct catcaggacc ttcatcatct agtgaggaag atgattcccg cgatattgaa 120 

agcttggata agaaaatacg tcctttagaa gaattagaag cattattaag tagtggaaat 180 

. acaaaacaat tgaagaacaa agaggtcgct gccttggtta ttcacggtaa gttacctttg 240 

tacgctttgg agaaaaaatt aggtgatact acgagagcgg ttgcggtacg taggaaggct 300 

ctttcaattt tggcagaagc tcctgtatta gcatctgatc gtttaccata taaaaattat 360 

gactacgacc gcgtatttgg cgcttgttgt gaaaatgtta taggttacat gcctttgccc 420 

gttggtgtta taggcccctt ggttatcgat ggtacatctt atcatatacc aatggcaact 480 

acagagggtt gtttggtagc ttctgccatg cgtggctgta aggcaatcaa tgctggcggt 540 

ggtgcaacaa ctgttttaac taaggatggt atgacaagag gcccagtagt ccgtttccca 600 

actttgaaaa gatctggtgc ctgtaagata tggttagact cagaagaggg acaaaacgca 660 

attaaaaaag cttttaactc tacatcaaga tttgcacgtc tgcaacatat tcaaacttgt 720 

ctagcaggag atttactctt catgagattt agaacaacta ctggtgacgc aatgggtatg 780 

aatatgattt ctaaaggtgt cgaatactca ttaaagcaaa tggtagaaga gtatggctgg 840 

gaagatatgg aggttgtctc cgtttctggt aactactgta ccgacaaaaa accagctgcc 900 
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atcaactgga 
gttgtcagaa 
aatttggttg 
ttagtgacag 
aactgtataa 
tccatcgaag 
gacttattag 
gcaagaatag 
gccggccatt 
cctaacaatt 
aaatcctaa 



tcgaaggtcg 
aagtgttaaa 
gatctgcaat 
ctgttttctt 
cattgatgaa 
taggtaccat 
gtgtaagagg 
ttgcctgtgc 
tggttcaaag 
tggacgccac 



tggtaagagt 
aagtgatgtt 
ggctgggtct 
ggcattagga 
agaagtggac 
cggtggtggt 
cccgcatgct 
cgtcttggca 
tcatatgacc 
tgatataaat 



gtcgtcgcag 
tccgcattgg 
gttggtggat 
caagatcctg 
ggtgatttga 
actgttctag 
accgctcctg 
ggtgaattat 
cacaacagga 
cgtttgaaag 



aagctactat 
ttgagttgaa 
ttaacgcaca 
cacaaaatgt 
gaatttccgt 
aaccacaagg 
gtaccaacgc 
ccttatgtgc 
aacctgctga 
atgggtccgt 



tcctggtgat 
cattgctaag 
tgcagctaat 
tgaaagttcc 
atccatgcca 
tgccatgttg 
acgtcaatta 
tgccctagca 
accaacaaaa 
cacctgcatt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1509 



<210> 28 
<211> 1332 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 28 

atgtcattac 

gctgtgtaca 

ataagcgagt 

cataagtggt 

ttggccaagg 

ccgttgttag 

atgtttgttt 

cccatcggtg 

gcctacttgg 

catatagtga 

atagataacg 

ggaacaataa 

ctaacctata 

gtcaccgaga 

ctacaaggct 

gtagaaacta 

ctgcttgtct 

gatttgagaa 

ttgttacgaa 

gattttagtt 

gcaaaaaatt 

aaaactacca 

tggacttcat 



cgttcttaac 
acaagcctgc 
catctgcacc 
ccatcaatga 
ctcaacaagc 
ctcaactatc 
gcctatgccc 
ctgggttggg 
gggggttaat 
atcaatgggc 
ctgtggccac 
acacaaacaa 
ctagaattcc 
aatttcctga 
tagagatcat 
ataatgaact 
caatcggtgt 
ttggctccac 
gagacattac 
acgagacatt 
tgaataaaga 
caaagcaaca 
ag 



ttctgcaccg 
cgtcgctgct 
agatactatt 
tttcaatgcc 
caccgatggc 
cgaatccttc 
ccatgccaag 
ctcaagcgcc 
aggatctaat 
cttcataggt 
ttatggtaat 
ttttaagttc 
aaggtctaca 
agttatgaag 
gactaagtta 
gtatgaacaa 
ttctcatcct 
aaaacttacc 
tcaagagcaa 
tgaaacagac 
tcttaaaatc 
aattgacgat 



ggaaaggtta 
agtgtgtctg 
gaattggact 
atcaccgagg 
ttgtctcagg 
cactaccatg 
aatattaagt 
tctatttctg 
gacttggaaa 
gaaaagtgta 
gccctgctat 
ttagatgatt 
aaagatcttg 
ccaattctag 
agtaaatgta 
ctattggaat 
ggattagaac 
ggtgctggtg 
attgacagct 
ttgggtggga 
aaatccctag 
ctattattgc 



ttatttttgg 
cgttgagaac 
tcccggacat 
atcaagtaaa 
aactcgttag 
cagcgttttg 
tttctttaaa 
tatcactggc 
agctgtcaga 
ttcacggtac 
ttgaaaaaga 
tcccagccat 
ttgctcgcgt 
atgccatggg 
aaggcaccga 
tgataagaat 
ttattaaaaa 
gcggcggttg 
tcaaaaagaa 
ctggctgctg 
tattccaatt 
caggaaacac 



tgaacactct 
ctacctgcta 
tagctttaat 
ctcccaaaaa 
tcttttggat 
tttcctgtat 
gtctacttta 
cttagctatg 
aaacgataag 
cccttcagga 
ctcacataat 
tccaatgatc 
tcgtgtgttg 
tgaatgtgcc 
tgacgaggct 
aaatcatgga 
tctgagcgat 
ctctttgact 
attgcaagat 
tttgttaagc 
atttgaaaat 
gaatttacca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
.1020 
1080 
1140 
1200 
1260 
1320 
1332 



<210> 29" 
<211> 1356 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 29 

atgtcagagt 

gttttagata 

gcccatcctt 

caatttaaag 

tcgataggcg 

tttaaaccta 

gatgatgcct 

agttttcatt 

ttagtcacag 

gtagacaaat 

ggtaaaattg 

agattcccac 

aaactggcgc 



tgagagcctt 
caaaatatga 
acggttcatt 
atggggagtg 
gatctaagaa 
acatggacga 
accattctca 
cgcacagaat 
ttttaactac 
atagagaagt 
gaagcgggtt 
ccgcattaat 
atttggttga 



cagtgcccca 
agcatttgta 
gcaagggtct 
gctgtaccat 
ccctttcatt 
ctactgcaat 
ggaggatagc 
tgaagaagtt 
agctttggcc 
tattcataat 
tgatgtagcg 
ctctaatttg 
tgaagaagac 



gggaaagcgt 
gtcggattat 
gataagtttg 
ataagtccta 
gaaaaagtta 
agaaacttgt 
gttaccgaac 
cccaaaacag 
tccttttttg 
ttagcacaag 
gcggcagcat 
ccagatattg 
tggaatatta 



tactagctgg 
cggcaagaat 
aagtgcgtgt 
aaagtggctt 
tcgctaacgt 
tcgttattga 
atcgtggcaa 
ggctgggctc 
tatcggacct 
ttgctcattg 
atggatctat 
gaagtgctac 
cgattaaaag 



tggatattta 60 
gcatgctgta 120 
gaaaagtaaa 180 
cattcctgtt 240 
atttagctac 300 
tattttctct 360 
cagaagattg 420 
ctcggcaggt 480 
ggaaaataat 540 
tcaagctcag 600 
cagatataga 660 
ttacggcagt 720 
taaccattta 780 



8 



ccttcgggat taactttatg gatgggcgat attaagaatg gttcagaaac agtaaaactg 840 

gtccagaagg taaaaaattg gtatgattcg catatgccag aaagcttgaa aatatataca 900 

gaactcgatc atgcaaattc tagatttatg gatggactat ctaaactaga tcgcttacac 960 

gagactcatg acgattacag cgatcagata tttgagtctc ttgagaggaa tgactgtacc 1020 

tgtcaaaagt atcctgaaat cacagaagtt agagatgcag ttgccacaat tagacgttcc 1080 

tttagaaaaa taactaaaga atctggtgcc gatatcgaac ctcccgtaca aactagctta 1140 

ttggatgatt gccagacctt aaaaggagtt cttacttgct taatacctgg tgctggtggt 1200 

tatgacgcca ttgcagtgat tactaagcaa gatgttgatc ttagggctca aaccgctaat 1260 

gacaaaagat tttctaaggt tcaatggctg gatgtaactc aggctgactg gggtgttagg 1320 

aaagaaaaag atccggaaac ttatcttgat aaatag 1356 

<210> 30 
<211> 1191 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 30 

atgaccgttt acacagcatc cgttaccgca cccgtcaaca tcgcaaccct taagtattgg 60 
gggaaaaggg acacgaagtt gaatctgccc accaattcgt ccatatcagt gactttatcg 120 
caagatgacc tcagaacgtt gacctctgcg gctactgcac ctgagtttga acgcgacact 180 
ttgtggttaa atggagaacc acacagcatc gacaatgaaa gaactcaaaa ttgtctgcgc 240 
gacctacgcc aattaagaaa ggaaatggaa tcgaaggacg cctcattgcc cacattatct 300 
caatggaaac tccacattgt ctccgaaaat aactttccta cagcagctgg tttagcttcc 360 
tccgct'gctg gctttgctgc attggtctct gcaattgcta agttatacca attaccacag 420 
tcaacttcag aaatatctag aatagcaaga aaggggtctg gttcagcttg tagatcgttg 480 
tttggcggat acgtggcctg ggaaatggga aaagctgaag atggtcatga ttccatggca 540 
gtacaaatcg cagacagctc tgactggcct cagatgaaag cttgtgtcct agttgtcagc 600 
gatattaaaa aggatgtgag ttccactcag ggtatgcaat tgaccgtggc aacctccgaa 660 
ctatttaaag aaagaattga acatgtcgta ccaaagagat ttgaagtcat gcgtaaagcc 720 
attgttgaaa aagatttcgc cacctttgca aaggaaacaa tgatggattc caactctttc 780 
catgccacat gtttggactc tttccctcca atattctaca tgaatgacac ttccaagcgt 840 
atcatcagtt ggtgccacac cattaatcag ttttacggag aaacaatcgt tgcatacacg 900 
tttgatgcag gtccaaatgc tgtgttgtac tacttagctg aaaatgagtc gaaactcttt 960 
gcatttatct ataaattgtt tggctctgtt cctggatggg acaagaaatt tactactgag 1020 
cagcttgagg ctttcaacca tcaatttgaa tcatctaact ttactgcacg tgaattggat 1080 
cttgagttgc aaaaggatgt tgccagagtg attttaactc aagtcggttc aggcccacaa 1140 
gaaacaaacg aatctttgat tgacgcaaag actggtctac caaaggaata a 1191 

<210> 31 
<211> 9253 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "single operon" nucleotide sequence 
<400> 31 

gacgcttttt atcgcaactc tctactgttt ctccataccc gtttttttgg gctagcagga 60 

ggaattcacc atggtacccg ggaggaggat tactatatgc aaacggaaca cgtcatttta 120 

ttgaatgcac agggagttcc cacgggtacg ctggaaaagt atgccgcaca cacggcagac 180 

acccgcttac atctcgcgtt ctccagttgg ctgtttaatg ccaaaggaca attattagtt 240 

acccgccgcg cactgagcaa aaaagcatgg cctggcgtgt ggactaactc ggtttgtggg 300 

cacccacaac tgggagaaag caacgaagac gcagtgatcc gccgttgccg ttatgagctt 360 

ggcgtggaaa ttacgcctcc tgaatctatc tatcctgact ttcgctaccg cgccaccgat 420 

ccgagtggca ttgtggaaaa tgaagtgtgt ccggtatttg ccgcacgcac cactagtgcg 480 

ttacagatca atgatgatga agtgatggat tatcaatggt gtgatttagc agatgtatta 540 

cacggtattg atgccacgcc gtgggcgttc agtccgtgga tggtgatgca ggcgacaaat 600 

cgcgaagcca gaaaacgatt atctgcattt acccagctta aataacccgg ggatcctcta 660 

gagtcgacta ggaggaatat aaaatgaaaa attgtgtcat cgtcagtgcg gtacgtactg 720 

ctatcggtag ttttaacggt tcactcgctt ccaccagcgc catcgacctg ggggcgacag 780 

taattaaagc cgccattgaa cgtgcaaaaa tcgattcaca acacgttgat gaagtgatta 840 
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tgggtaacgt gttacaagcc gggctggggc 
gcgggctggc agaaacggtg tgcggattca 
aaagtgtggc gcttgccgcc caggccattc 
ggggtatgga aaatatgagt ttagccccct 
atcgtcttgg agacggacag gtttatgacg 
cccatggtta tcatatgggg attaccgccg 
gtgaaatgca ggatgaactg gcgctacatt 
ccggtgcttt tacagccgaa atcgtcccgg 
tcttcagtca agacgaattc ccgaaagcga 
gcccggcctt cgataaagca ggaacagtca 
gtgctgccgc tctggtgatt atggaagaat 
tggctcgcat taaaagttat gccagcggtg 
cagtacctgc cacgcaaaaa gcgttacaac 
tcattgaggc taatgaagca tttgctgcac 
ttgattctga gaaagtgaat gtcaacggcg 
ccagtggtgc tcgtattctg gtcacactat 
tggggctggc aacactgtgc attggcggcg 
tgaattaagg aggacagcta aatgaaactc 
ggaagactta ggccgcaaaa gcaacaacaa 
ctaaaaaaac aaaagaccgc tgaacaaaaa 
atccaaattt acatcccaac tcaatgtgtc 
gtttctcaag gtaaatacac aattggtctg 
agagaagata tctactcgat gtccctaact 
atcgacacca acaaaattgg tagattagaa 
aagtctgtca agtctgtctt gatgcaattg 
gacacgctta atgcctgtta cggtggtacc 
gaatctaacg catgggatgg tagagacgcc 
gataagggtg ccgcaagacc aaccggtggt 
gatgctccaa ttgtatttga ctctgtaaga 
tacaagccag atttcaccag cgaatatcct 
tacgtcaagg ctcttgatca agtttacaag 
ttggttagcg atcccgctgg ttcggatgct 
gttttccatg ttccaacctg taaattggtc 
gatttcagag ccaatcctca attgttccca 
tatgacgaat ctttaaccga taagaacatt 
ttccacaaag agagagttgc ccaatctttg 
accgcatctg tttatgccgc ctttgcatct 
caaggcaagc gtgttggttt attttcttac 
tgcaaaattg ttggtgacgt ccaacatatt 
gccaagagaa tcaccgaaac tccaaaggat 
gcccatttga agaagaactt caaacctcaa 
tactacttga ccaacatcga tgacaaattt 
ggattacact atggttttaa ccaataaaa'c 
atcatctgcg caatcgagct catcaggacc 
cgatattgaa agcttggata agaaaatacg 
tagtggaaat acaaaacaat tgaagaacaa 
gttacctttg tacgctttgg agaaaaaatt 
taggaaggct ctttcaattt tggcagaagc 
taaaaattat gactacgacc gcgtatttgg 
gcctttgccc gttggtgtta taggcccctt 
aatggcaact acagagggtt gtttggtagc 
tgctggcggt ggtgcaacaa ctgttttaac 
ccgtttccca actttgaaaa gatctggtgc 
acaaaacgca attaaaaaag cttttaactc 
tcaaacttgt ctagcaggag atttactctt 
aatgggtatg aatatgattt ctaaaggtgt 
gtatggctgg gaagatatgg aggttgtctc 
accagctgcc atcaactgga tcgaaggtcg 
tcctggtgat gttgtcagaa aagtgttaaa 
cattgctaag. aatttggttg gatctgcaat 
tgcagctaat ttagtgacag ctgttttctt 



aaaatccggc gcgtcaggca ctgttaaaaa 900 
cggtcaataa agtatgtggt tcgggtctta 960 
aggcaggtca ggcgcagagc attgtggcgg 1020 
acttactcga tgcaaaagca cgctctggtt 1080 
taatcctgcg cgatggcctg atgtgcgcca 1140 
aaaacgtggc taaagagtac ggaattaccc 1200 
cacagcgtaa agcggcagcc gcaattgagt 1260 
taaatgttgt cactcgaaag aaaaccttcg 1320 
attcaacggc tgaagcgtta ggtgcattgc 1380. 
ccgctgggaa cgcgtctggt attaacgacg 1440 
ctgcggcgct ggcagcaggc cttacccccc 1500 
gcgtgccccc cgcattgatg ggtatggggc 1560 
tggcggggct gcaactggcg gatattgatc 1620 
agttccttgc cgttgggaaa aacctgggct 1680 
gggccatcgc gctcgggcat cctatcggtg 1740 
tacatgccat gcaggcacgc gataaa.acgc 1800 
gtcagggaat tgcgatggtg attgaacggt 18 60 
tcaactaaac tttgttggtg tggtattaaa 1920 
ttacacaata caaacttgca aatgactgaa 1980 
accagacctc aaaatgtcgg tattaaaggt 2040 
aaccaatctg agctagagaa atttgatggc 2100 
ggccaaacca acatgtcttt tgtcaatgac 2160 
gttttgtcta agttgatcaa gagttacaac 2220 
gtcggtactg aaactctgat tgacaagtcc 2280 
tttggtgaaa acactgacgt cgaaggtatt 2340 
aacgcgttgt tcaactcttt gaactggatt 2400 
attgtagttt gcggtgatat tgccatctac 24 60 
gccggtactg ttgctatgtg gatcggtcct 2520 
gcttcttaca tggaacacgc ctacgatttt 2580 
tacgtcgatg gtcatttttc attaacttgt 2640 
agttattcca agaaggctat ttctaaaggg 2700 
ttgaacgttt tgaaatattt cgactacaac 2760 
acaaaatcat acggtagatt actatataac 2820 
gaagttgacg ccgaattagc tactcgcgat 2880 
gaaaaaactt ttgttaatgt tgctaagcca 2940 
attgttccaa caaacacagg taacatgtac 3000 
ctattaaact atgttggatc tgacgactta 3060 
ggttccggtt tagctgcatc tctatattct 3120 
atcaaggaat tagatattac taacaaatta 3180 
tacgaagctg ccatcgaatt gagagaaaat 3240 
ggttccattg agcatttgca aagtggtgtt 3300 
agaagatctt acgatgttaa aaaataagga 3360 
agtcatttct ggatcgaaag tcaaaagttt 3420 
ttcatcatct agtgaggaag atgattcccg 3480 
tcctttagaa gaattagaag cattattaag 3540 
agaggtcgct gccttggtta ttcacggtaa 3600 
aggtgatact acgagagcgg ttgcggtacg 3660 
tcctgtatta gcatctgatc gtttaccata 3720 
cgcttgttgt gaaaatgtta taggttacat 3780 
ggttatcgat ggtacatctt atcatatacc 3840 
ttctgccatg cgtggctgta aggcaatcaa 3900 
taaggatggt atgacaagag gcccagtagt 3960 
ctgtaagata tggttagact cagaagaggg 4020 
tacatcaaga tttgcacgtc tgcaacatat 4080 
catgagattt agaacaacta ctggtgacgc 4140 
cgaatactca ttaaagcaaa tggtagaaga 4200 
cgtttctggt aactactgta ccgacaaaaa 4260 
tggtaagagt gtcgtcgcag aagctactat 4320 
aagtgatgtt tccgcattgg ttgagttgaa 4380 
ggctgggtct gttggtggat ttaacgcaca 4440 
ggcattagga caagatcctg cacaaaatgt 4500 
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tgaaagttcc aactgtataa cattgatgaa agaagtggac ggtgatttga gaatttccgt 4 560 
atccatgcca tccatcgaag taggtaccat cggtggtggt actgttctag aaccacaagg 4 620 
tgccatgttg gacttattag gtgtaagagg cccgcatgct accgctcctg gtaccaacgc 4 680 
acgtcaatta gcaagaatag ttgcctgtgc cgtcttggca ggtgaattat ccttatgtgc 4740 
. tgccctagca gccggccatt tggttcaaag tcatatgacc cacaacagga aacctgctga 4 800 
accaacaaaa cctaacaatt tggacgccac tgatataaat cgtttgaaag atgggtccgt 4860 
cacctgcatt aaatcctaag tcgacctgca gtaggaggaa ttaaccatgt cattaccgtt 4 920 
cttaacttct gcaccgggaa aggttattat ttttggtgaa cactctgctg. tgtacaacaa 4980 
gcctgccgtc gctgctagtg tgtctgcgtt gagaacctac ctgctaataa gcgagtcatc 5040 
tgcaccagat actattgaat tggacttccc ggacattagc tttaatcata agtggtccat 5100 
caatgatttc aatgccatca ccgaggatca agtaaactcc caaaaattgg ccaaggctca 5160 
acaagccacc gatggcttgt ctcaggaact cgttagtctt ttggatccgt tgttagctca 5220 
actatccgaa tccttccact accatgcagc gttttgtttc ctgtatatgt ttgtttgcct 5280 
atgcccccat gccaagaata ttaagttttc tttaaagtct actttaccca tcggtgctgg 5340 
gttgggctca agcgcctcta tttctgtatc actggcctta gctatggcct acttgggggg 5400 
gttaatagga tctaatgact tggaaaagct gtcagaaaac gataagcata tagtgaatca 54 60 
atgggccttc ataggtgaaa agtgtattca cggtacccct tcaggaatag ataacgctgt 5520 
ggccacttat ggtaatgccc tgctatttga aaaagactca - cataatggaa caataaacac 5580 
aaacaatttt aagttcttag atgatttccc agccattcca atgatcctaa cctatactag 5640 
aattccaagg tctacaaaag atcttgttgc tcgcgttcgt gtgttggtca ccgagaaatt 5700 
tcctgaagtt atgaagccaa ttctagatgc catgggtgaa tgtgccctac aaggcttaga 5760 
gatcatgact aagttaagta aatgtaaagg caccgatgac gaggctgtag aaactaataa 5820 
tgaactgtat gaacaactat tggaattgat aagaataaat catggactgc ttgtctcaat 5880 
cggtgtttct catcctggat tagaacttat taaaaatctg agcgatgatt tgagaattgg 5940 
ctccacaaaa cttaccggtg ctggtggcgg cggttgctct ttgactttgt tacgaagaga 6000 
cattactcaa gagcaaattg acagcttcaa aaagaaattg caagatgatt ttagttacga 6060 
gacatttgaa acagacttgg gtgggactgg ctgctgtttg ttaagcgcaa aaaatttgaa 6120 
taaagatctt aaaatcaaat ccctagtatt ccaattattt gaaaataaaa ctaccacaaa 6180 
gcaacaaatt gacgatctat tattgccagg aaacacgaat ttaccatgga cttcatagga 6240 
ggcagatcaa atgtcagagt tgagagcctt cagtgcccca gggaaagcgt tactagctgg 6300 
tggatattta gttttagata caaaatatga agcatttgta gtcggattat cggcaagaat 6360 
gcatgctgta gcccatcctt acggttcatt gcaagggtct gataagtttg aagtgcgtgt 6420 
gaaaagtaaa caatttaaag atggggagtg gctgtaccat ataagtccta aaagtggctt 6480 
cattcctgtt tcgataggcg gatctaagaa ccctttcatt gaaaaagtta tcgctaacgt 6540 
atttagctac tttaaaccta acatggacga ctactgcaat agaaacttgt tcgttattga 6600 
tattttctct gatgatgcct accattctca ggaggatagc gttaccgaac atcgtggcaa 6660 
cagaagattg agttttcatt cgcacagaat tgaagaagtt cccaaaacag ggctgggctc 6720 
ctcggcaggt ttagtcacag ttttaactac agctttggcc tccttttttg tatcggacct 6780 
ggaaaataat gtagacaaat atagagaagt tattcataat ttagcacaag ttgctcattg 6840 
tcaagctcag ggtaaaattg gaagcgggtt tgatgtagcg gcggcagcat atggatctat 6900 
cagatataga agattcccac ccgcattaat ctctaatttg ccagatattg gaagtgctac 6960 
ttacggcagt aaactggcgc atttggttga tgaagaagac tggaatatta cgattaaaag 7020 
taaccattta ccttcgggat taactttatg gatgggcgat attaagaatg gttcagaaac 7080 
agtaaaactg gtccagaagg taaaaaattg gtatgattcg catatgccag aaagcttgaa 7140 
aatatataca gaactcgatc atgcaaattc tagatttatg gatggactat ctaaactaga 7200 
tcgcttacac gagactcatg acgattacag cgatcagata tttgagtctc ttgagaggaa 7260 
tgactgtacc tgtcaaaagt atcctgaaat cacagaagtt agagatgcag ttgccacaat 7320 
tagacgttcc tttagaaaaa taactaaaga atctggtgcc gatatcgaac ctcccgtaca- 7380 
aactagctta ttggatgatt gccagacctt aaaaggagtt cttacttgct taatacctgg 7440 
tgctggtggt tatgacgcca ttgcagtgat tactaagcaa gatgttgatc ttagggctca 7500 
aaccgctaat gacaaaagat tttctaaggt tcaatggctg gatgtaactc aggctgactg 7560 
gggtgttagg aaagaaaaag atccggaaac ttatcttgat aaataggagg taatactcat 7620 
gaccgtttac acagcatccg ttaccgcacc cgtcaacatc gcaaccctta agtattgggg 7680 
gaaaagggac acgaagttga atctgcccac caattcgtcc atatcagtga ctttatcgca 7740 
agatgacctc agaacgttga cctctgcggc tactgcacct gagtttgaac gcgacacttt 7800 
gtggttaaat ggagaaccac acagcatcga caatgaaaga actcaaaatt gtctgcgcga 7860 
cctacgccaa ttaagaaagg aaatggaatc gaaggacgcc tcattgccca cattatctca 7920 
atggaaactc cacattgtct ccgaaaataa ctttcctaca gcagctggtt tagcttcctc 7980 
cgctgctggc tttgctgcat tggtctctgc aattgctaag ttataccaat taccacagtc 8040 
aacttcagaa atatctagaa tagcaagaaa ggggtctggt tcagcttgta gatcgttgtt 8100 
tggcggatac gtggcctggg aaatgggaaa agctgaagat ggtcatgatt ccatggcagt 8160 
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acaaatcgca gacagctctg actggcctca gatgaaagct tgtgtcctag ttgtcagcga 8220 
tattaaaaag gatgtgagtt ccactcaggg tatgcaattg accgtggcaa cctccgaact 8280 
atttaaagaa agaattgaac atgtcgtacc aaagagattt gaagtcatgc gtaaagccat 8340 
tgttgaaaaa gatttcgcca cctttgcaaa ggaaacaatg atggattcca actctttcca 8400 
tgccacatgt ttggactctt tccctccaat attctacatg aatgacactt ccaagcgtat 84 60 
catcagttgg tgccacacca ttaatcagtt ttacggagaa acaatcgttg catacacgtt 8520 
tgatgcaggt ccaaatgctg tgttgtacta cttagctgaa aatgagtcga aactctttgc 8580 
atttatctat" aaattgtttg gctctgttcc tggatgggac aagaaattta ctactgagca 8640 
gcttgaggct ttcaaccatc aatttgaatc atctaacttt actgcacgtg aattggatct 8700 
tgagttgcaa aaggatgttg ccagagtgat tttaactcaa gtcggttcag gcccacaaga 8760 
aacaaacgaa tctttgattg acgcaaagac tggtctacca- aaggaataac tgcaggcatg 8820 
caagcttggc tgttttggcg gatgagagaa gattttcagc ctgatacaga ttaaatcaga 8880 
acgcagaagc ggtctgataa aacagaattt gcctggcggc agtagcgcgg tggtcccacc 8940 
tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggtctcc 9000 
ccatgcgaga gtagggaact gccaggeatc aaataaaacg aaaggctcag tcgaaagact 9060 
gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg acaaatccgc 9120 
cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca ggacgcccgc 9180 
cataaactgc caggcatcaa attaagcaga aggccatcct gacggatggc ctttttgcgt 9240 
ttctacaaac tct * 9253 



<210> 32 
<211> 4760 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "MEVT" operon nucleotide sequence 
<400> 32 

gacgcttttt atcgcaactc tctactgttt ctccataccc gtttttttgg gctagcagga 60 

ggaattcacc atggtacccg gggatcctct agagtcgact aggaggaata taaaatgaaa 120 

aattgtgtca tcgtcagtgc ggtacgtact gctatcggta gttttaacgg ttcactcgct 180 

tccaccagcg ccatcgacct gggggcgaca gtaattaaag ccgccattga acgtgcaaaa 240 

. atcgattcac aacacgttga tgaagtgatt atgggtaacg tgttacaagc cgggctggg'g 300 

caaaatccgg cgcgtcaggc actgttaaaa agcgggctgg cagaaacggt gtgcggattc 360 

acggtcaata aagtatgtgg ttcgggtctt aaaagtgtgg cgcttgccgc ccaggccatt 420 

caggcaggtc aggcgcagag cattgtggcg gggggtatgg aaaatatgag tttagccccc 480 

tacttactcg atgcaaaagc acgctctggt tatcgtcttg gagacggaca ggtttatgac 540 

gtaatcctgc gcgatggcct gatgtgcgcc acccatggtt atcatatggg gattaccgcc 600 

gaaaacgtgg ctaaagagta cggaattacc cgtgaaatgc aggatgaact ggcgctacat 660 

tcac^gcgta aagcggcagc cgcaattgag tccggtgctt ttacagccga aatcgtcccg 720 

gtaaatgttg tcactcgaaa gaaaaccttc gtcttcagtc aagacgaatt cccgaaagcg 780 

aattcaacgg ctgaagcgtt aggtgcattg cgcccggcct tcgataaagc aggaacagtc 840 

accgctggga acgcgtctgg tattaacgac ggtgctgccg ctctggtgat tatggaagaa 900 

tctgcggcgc tggcagcagg ccttaccccc ctggctcgca ttaaaagtta tgccagcggt 960 

ggcgtgcccc ccgcattgat gggtatgggg ccagtacctg ccacgcaaaa agcgttacaa 1020 

ctggcggggc tgcaactggc ggatattgat ctcattgagg ctaatgaagc atttgctgca 1080 

cagttccttg ccgttgggaa aaacctgggc tttgattctg agaaagtgaa tgtcaacggc 1140 

ggggccatcg cgctcgggca tcctatcggt gccagtggtg ctcgtattct ggtcacacta 1200 

ttacatgcca tgcaggcacg cgataaaacg ctggggctgg caacactgtg cattggcggc 1260 

ggtcagggaa ttgcgatggt gattgaacgg ttgaattaag gaggacagct aaatgaaact 1320 

ctcaactaaa ctttgttggt gtggtattaa aggaagactt aggccgcaaa agcaacaaca 1380 

attacacaat acaaacttgc , aaatgactga actaaaaaaa caaaagaccg ctgaacaaaa 1440 

aaccagacct caaaatgtcg gtattaaagg tatccaaatt tacatcccaa ctcaatgtgt 1500 

caaccaatct gagctagaga aatttgatgg cgtttctcaa ggtaaataca caattggtct 1560 

gggccaaacc aacatgtctt ttgtcaatga cagagaagat atctactcga tgtccctaac 1620 

tgttttgtct aagttgatca agagttacaa catcgacacc aacaaaattg gtagattaga 1680 

agtcggtact gaaactctga ttgacaagtc caagtctgtc. aagtctgtct tgatgcaatt 1740 

gtttggtgaa aacactgacg tcgaaggtat tgacacgctt aatgcctgtt acggtggtac 1800 
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caacgcgttg 
cattgtagtt 
tgccggtact 
agcttcttac 
ttacgtcgat 
gagttattcc 
tttgaacgtt 
cacaaaatca 
agaagttgac 
tgaaaaaact 
gattgttcca 
tctattaaac 
cggttccggt 
tatcaaggaa 
ttacgaagct 
aggttccatt 
tagaagatct 
cagtcatttc 
cttcatcatc 
gtcctttaga 
aagaggtcgc 
taggtgatac 
ctcctgtatt 
gcgcttgttg 
tggttatcga 
cttctgccat 
ctaaggatgg 
cctgtaagat 
ctacatcaag 
tcatgagatt 
tcgaatactc 
ccgtttctgg 
gtggtaagag 
aaagtgatgt 
tggctgggtc 
tggcattagg 
aagaagtgga 
tcggtggtgg 
gcccgcatgc 
ccgtcttggc 
gtcatatgac 
ctgatataaa 
aggcatgcaa 
aatcagaacg 
tcccacctga 
ggtctcccca 
aaagactggg 
aatccgccgg 
cgcccgccat 
tttgcgtttc 



ttcaactctt 
tgcggtgata 
gttgctatgt 
atggaacacg 
ggtcattttt 
aagaaggcta 
ttgaaatatt 
tacggtagat 
gccgaattag 
tttgttaatg 
acaaacacag 
tatgttggat 
ttagctgcat 
ttagatatta 
gccatcgaat 
gagcatttgc 
tacgatgtta 
tggatcgaaa 
tagtgaggaa 
agaattagaa 
tgccttggtt 
tacgagagcg 
agcatctgat 
tgaaaatgtt 
tggtacatct 
gcgtggctgt 
tatgacaaga 
atggttagac 
atttgcacgt 
tagaacaact 
attaaagcaa 
taactactgt 
tgtcgtcgca 
ttccgcattg 
tgttggtgga 
acaagatcct 
cggtgatttg 
tactgttcta 
taccgctcct 
aggtgaatta 
ccacaacagg 
tcgtttgaaa 
gcttggctgt 
cagaagcggt 
ccccatgccg 
tgcgagagta 
cctttcgttt 
gagcggattt 
aaactgccag 
tacaaactct 



tgaactggat 
ttgccatcta 
ggatcggtcc 
cctacgattt 
cattaacttg 
tttctaaagg 
tcgactacaa 
tactatataa 
ctactcgcga 
ttgctaagcc 
gtaacatgta 
ctgacgactt 
ctctatattc 
ctaacaaatt 
tgagagaaaa 
aaagtggtgt 
aaaaataagg 
gtcaaaagtt 
gatgattccc 
gcattattaa 
attcacggta 
gttgcggtac 
cgtttaccat 
ataggttaca 
tatcatatac 
aaggcaatca 
ggcccagtag 
tcagaagagg 
ctgcaacata 
actggtgacg 
atggtagaag 
accgacaaaa 
gaagctacta 
gttgagttga 
tttaacgcac 
gcacaaaatg 
agaatttccg 
gaaccacaag 
ggtaccaacg 
tccttatgtg 
aaacctgctg 
gatgggtccg 
tttggcggat 
ctgataaaac 
aactcagaag 
gggaactgcc 
tatctgttgt 
gaacgttgcg 
gcatcaaatt 



tgaatctaac 
cgataagggt 
tgatgctcca 
ttacaagcca 
ttacgtcaag 
gttggttagc 
cgttttccat 
cgatttcaga 
ttatgacgaa 
attccacaaa 
caccgcatct 
acaaggcaag 
ttgcaaaatt 
agccaagaga 
tgcccatttg 
ttactacttg 
aggattacac 
tatcatctgc 
gcgatattga 
gtagtggaaa 
agttaccttt 
gtaggaaggc 
ataaaaatta 
tgcctttgcc 
caatggcaac 
atgctggcgg 
tccgtttccc 
gacaaaacgc 
ttcaaacttg 
caatgggtat 
agtatggctg 
aaccagctgc 
ttcctggtga 
acattgctaa 
atgcagctaa 
ttgaaagttc 
tatccatgcc 
gtgccatgtt 
cacgtcaatt 
ctgccctagc 
aaccaacaaa 
tcacctgcat 
gagagaagat 
agaatttgcc 
tgaaacgccg 
aggcatcaaa 
ttgtcggtga 
aagcaacggc 
aagcagaagg 



gcatgggatg 

gccgcaagac 

attgtatttg 

gatttcacca 

gctcttgatc 

gatcccgctg 

gttccaacct 

gccaatcctc 

tctttaaccg 

gagagagttg 

gtttatgccg 

cgtgttggtt 

gttggtgacg 

atcaccgaaa 

aagaagaact 

accaacatcg 

tatggtttta 

gcaatcgagc 

aagcttggat 

tacaaaacaa 

gtacgctttg 

tctttcaatt 

tgactacgac 

cgttggtgtt 

tacagagggt 

tggtgcaaca 

aactttgaaa 

aattaaaaaa 

tctagcagga 

gaatatgatt 

ggaagatatg 

catcaactgg 

tgttgtcaga 

gaatttggtt 

tttagtgaca 

caactgtata 

atccatcgaa 

ggacttatta 

agcaagaata 

agccggccat 

acctaacaat 

taaatcctaa 

tttcagcctg" 

tggcggcagt 

tagcgccgat 

taaaacgaaa 

acgctctcct 

ccggagggtg 

ccatcctgac 



gtagagacgc 
caaccggtgg 
actctgtaag 
gcgaatatcc 
aagtttacaa 
gttcggatgc 
gtaaattggt 
aattgttccc 
ataagaacat 
cccaatcttt 
cctttgcatc 
tattttctta 
tccaacatat 
ctccaaagga 
tcaaacctca 
atgacaaatt 
accaataaaa 
tcatcaggac 
aagaaaatac 
ttgaagaaca 
gagaaaaaat 
ttggcagaag 
cgcgtatttg 
ataggcccct 
tgtttggtag 
actgttttaa 
agatctggtg 
gcttttaact 
gatttactct 
tctaaaggtg 
gaggttgtct 
atcgaaggtc 
aaagtgttaa 
ggatctgcaa 
gctgttttct 
acattgatga 
gtaggtacca 
ggtgtaagag 
gttgcctgtg 
ttggttcaaa 
ttggacgcca 
gtcgacctgc 
atacagatta 
agcgcggtgg 
ggtagtgtgg 
ggctcagtcg 
gagtaggaca 
gcgggcagga 
ggatggcctt 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

30-60 

3120* 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4760 



<210> 33 
<211> 4482 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "MEVB " operon nucleotide sequence 
<400> 33 

gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 



60 
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gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 120 
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctggg 180 
taccgggccc cccctcgagg tcgacggtat cgataagctt gatatcgaat tcctgcagta 240 
ggaggaatta accatgtcat taccgttctt aacttctgca ccgggaaagg ttattatttt 300 
. tggtgaacac tctgctgtgt acaacaagcc tgccgtcgct gctagtgtgt ctgcgttgag 360 
aacctacctg ctaataagcg agtcatctgc accagatact attgaattgg acttcccgga 420 
cattagcttt aatcataagt ggtccatcaa tgatttcaat gccatcaccg aggatcaagt 480 
aaactcccaa aaattggcca aggctcaaca agccaccgat ggcttgtctc aggaactcgt 540 
tagtcttttg gatccgttgt tagctcaact atccgaatcc ttccactacc atgcagcgtt 600 
ttgtttcctg tatatgtttg tttgcctatg cccccatgcc aagaatatta agttttcttt 660 
aaagtctact ttacccatcg gtgctgggtt gggctcaagc gcctctattt ctgtatcact 720 
ggccttagct atggcctact tgggggggtt aataggatct aatgacttgg aaaagctgtc 780 
agaaaacgat aagcatatag tgaatcaatg ggccttcata ggtgaaaagt gtattcacgg 84 0 
taccccttca ggaatagata acgctgtggc cacttatggt aatgccctgc . tatttgaaaa 900 
agactcacat aatggaacaa taaacacaaa caattttaag ttcttagatg atttcccagc 960 
cattccaatg atcctaacct atactagaat tccaaggtct acaaaagatc ttgttgctcg 1020 
cgttcgtgtg ttggtcaccg agaaatttcc tgaagttatg aagccaattc tagatgccat 1080 
gggtgaatgt gccctacaag gcttagagat catgactaag ttaagtaaat gtaaaggcac 1140 
cgatgacgag gctgtagaaa ctaataatga actgtatgaa caactattgg aattgataag 1200 
aataaatcat ggactgcttg tctcaatcgg tgtttctcat cctggattag aacttattaa 1260 
aaatctgagc gatgatttga gaattggctc cacaaaactt accggtgctg gtggcggcgg 1320 
ttgctctttg actttgttac gaagagacat tactcaagag caaattgaca gcttcaaaaa 1380 
gaaattgcaa gatgatttta gttacgagac atttgaaaca gacttgggtg ggactggctg 1440 
ctgtttgtta agcgcaaaaa atttgaataa agatcttaaa atcaaatccc tagtattcca 1500 
attatttgaa aataaaacta ccacaaagca acaaattgac gatctattat tgccaggaaa 1560 
cacgaattta ccatggactt .cataggaggc agatcaaatg tcagagttga gagccttcag 1620 
tgccccaggg aaagcgttac tagctggtgg atatttagtt ttagatacaa aatatgaagc 1680 
atttgtagtc ggattatcgg caagaatgca tgctgtagcc catccttacg gttcattgca 1740 
agggtctgat aagtttg.aag tgcgtgtgaa aagtaaacaa tttaaagatg gggagtggct 1800 
gtaccatata agtcctaaaa gtggcttcat tcctgtttcg ataggcggat ctaagaaccc 1860 
tttcattgaa aaagttatcg ctaacgtatt tagctacttt aaacctaaca tggacgacta 1920 
ctgcaataga aacttgttcg. ttattgatat tttctctgat gatgcctacc attctcagga 1980 
ggatagcgtt accgaacatc gtggcaacag aagattgagt tttcattcgc acagaattga 2040 
agaagttccc aaaacagggc tgggctcctc ggcaggttta gtcacagttt taactacagc 2100 
tttggcctcc ttttttgtat cggacctgga aaataatgta gacaaatata gagaagttat 2160 
tcataattta gcacaagttg ctcattgtca agctcagggt aaaattggaa gcgggtttga 2220 
tgtagcggcg gcagcatatg gatctatcag atatagaaga ttcccacccg cattaatctc 2280 
taatttgcca gatattggaa gtgctactta cggcagtaaa ctggcgcatt tggttgatga 2340 
agaagactgg aatattacga ttaaaagtaa ccatttacct tcgggattaa ctttatggat 2400 
gggcgatatt aagaatggtt cagaaacagt aaaactggtc cagaaggtaa aaaattggta 24 60 
tgattcgcat atgccagaaa gcttgaaaat atatacagaa ctcgatcatg caaattctag 2520 
atttatggat ggactatcta aactagatcg cttacacgag actcatgacg attacagcga 2580 
tcagatattt gagtctcttg agaggaatga ctgtacctgt caaaagtatc ctgaaatcac 2640 
agaagttaga gatgcagttg ccacaattag acgttccttt agaaaaataa ctaaagaatc 2700 
tggtgccgat atcgaacctc ccgtacaaac tagcttattg gatgattgcc agaccttaaa 2760 
aggagttctt acttgcttaa tacctggtgc tggtggttat gacgccattg cagtgattac 2820 
taagcaagat gttgatctta gggctcaaac cgctaatgac aaaagatttt ctaaggttca 2880 
atggctggat gtaactcagg ctgactgggg tgttaggaaa gaaaaagatc cggaaactta 2940 
tcttgataaa taggaggtaa tactcatgac cgtttacaca gcatccgtta ccgcacccgt 3000 
caacatcgca acccttaagt attgggggaa aagggacacg aagttgaatc tgcccaccaa 3060 
ttcgtccata tcagtgactt tatcgcaaga tgacctcaga acgttgacct ctgcggctac 3120 
tgcacctgag tttgaacgcg acactttgtg gttaaatgga gaaccacaca gcatcgacaa 31.80 
tgaaagaact caaaattgtc tgcgcgacct acgccaatta agaaaggaaa tggaatcgaa 3240 
ggacgcctca ttgcccacat tatctcaatg gaaactccac attgtctccg aaaataactt 3300 
tcctacagca gctggtttag cttcctccgc tgctggcttt gctgcattgg tctctgcaat 3360 
tgctaagtta taccaattac cacagtcaac ttcagaaata tctagaatag caagaaaggg 3420 
gtctggttca gcttgtagat cgttgtttgg cggatacgtg gcctgggaaa tgggaaaagc 3480 
tgaagatggt catgattcca tggcagtaca aatcgcagac agctctgact ggcctcagat 3540 
gaaagcttgt gtcctagttg tcagcgatat taaaaaggat gtgagttcca ctcagggtat 3600 
gcaattgacc gtggcaacct ccgaactatt taaagaaaga attgaacatg tcgtaccaaa 3660 
gagatttgaa gtcatgcgta aagccattgt tgaaaaagat ttcgccacct ttgcaaagga 3720 
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aacaatgatg 
ctacatgaat 
cggagaaaca 
agctgaaaat 
atgggacaag 
taactttact 
aactcaagtc 
tctaccaaag 
ggtggagctc 
tacaacgtcg 
cccctttcgc 
tgcgcagcct 
aaatttttgt 



gattccaact 
gacacttcca 
atcgttgcat 
gagtcgaaac 
aaatttacta 
gcacgtgaat 
g'gttcaggcc 
gaataactgc 
caattcgccc 
tgactgggaa 
cagctggcgt 
gaatggcgaa 
taaatcagct 



ctttccatgc 
agcgtatcat 
acacgtttga 
tctttgcatt 
ctgagcagct 
tggatcttga 
cacaagaaac 
agcccggggg 
tatagtgagt 
aaccctggcg 
aatagcgaag 
tggaaattgt 
cattttttaa 



cacatgtttg 
cagttggtgc 
tgcaggtcca 
tatctataaa 
tgaggctttc 
gttgcaaaag 
aaacgaatct 
atccactagt 
cgtattacgc 
ttacccaact 
aggcccgcac 
aagcgttaat 
ccaataggcc 



gactctttcc 
cacaccatta 
aatgctgtgt 
ttgtttggct 
aaccatcaat 
gatgttgcca 
ttgattgacg 
tctagagcgg 
gcgctcactg 
taatcgcctt 
cgatcgccct 
attttgttaa 
ga 



ctccaatatt 
atcagtttta 
tgtactactt 
ctgttcctgg 
ttgaatcatc 
gagtgatttt 
caaagactgg 
ccgccaccgc 
gccgtcgttt 
gcagcacatc 
tcccaacagt 
aattcgcgtt 



3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4482 



<210> 34 
<211> 549 
<212> DNA 

<213> Escherichia coli 



<400> 34 

atgcaaacgg 

aagtatgccg 

aatgccaaag 

gtgtggacta 

atccgccgtt 

gactttcgct 

tttgccgcac 

tggtgtgatt 

tggatggtga 

cttaaataa 



aacacgtcat 
cacacacggc 
gacaattatt 
actcggtttg 
gccgttatga 
accgcgccac 
gcaccactag 
tagcagatgt 
tgcaggcgac 



tttattgaat 

agacacccgc' 

agttacccgc 

tgggcaccca 

gcttggcgtg 

cgatccgagt 

tgcgttacag 

attacacggt 

aaatcgcgaa 



gcacagggag 
ttacatctcg 
cgcgcactga 
caactgggag 
gaaattacgc 
ggcattgtgg 
atcaatgatg 
attgatgcca 
gccagaaaac 



ttcccacggg 
cgttctccag 
gcaaaaaagc 
aaagcaacga 
ctcctgaatc 
aaaatgaagt 
atgaagtgat 
cgccgtgggc 
gattatctgc 



tacgctggaa 60 
ttggctgttt 120 
atggcctggc 180 
agacgcagtg 240 
tatctatcct 300 
gtgtccggta 360 
ggattatcaa 420 
gttcagtccg 480 
atttacccag 540 
549 



<210> 35 
<211> 900 
<212> DNA 

<213> Escherichia coli 



<400> 35 

atggactttc 

tttatcgccc 

ttattaggtg 

gttagcacaa 

ttaattcatg 

tgccatgtga 

gcgttctcga 

atgatttctg 

gatttagacg 

aaaaccggcg 

ggacgtcgtg 

gttcaggatg 

gccgaccagc 

aagaaagccc 

tcactcgata 



cgcagcaact 
cactgccctt 
gtaagcgcct 
acacgctgga 
atgatttacc 
agtttggcga 
ttttaagcga 
aactggcgag 
cggaaggcaa 
cattgattcg 
ctctgccggt 
acatcctgga 
aacttggtaa 
gggatctgat 
cctcggcact 



cgaagcctgc 
tcagaacact 
gcgacctttc 
cgcacccgct 
ggcaatggat 
agcaaacgcg 
tgccgatatg 
cgccagtggt 
acacgtacct 
cgccgccgtt 
actcgacaag 
tgtggtggga 
aagtacctac 
cgacgatgcc 
ggaagcgcta 



gttaagcagg 
cccgtggtcg 
ctggtttatg 
gccgccgttg 
gatgacgatc 
attctcgctg 
ccggaagtgt 
attgccggaa 
ctggacgcgc 
cgccttggtg 
tatgcagaga 
gatactgcaa 
cctgcacttc 
cgtcagtcgc 
gcggactaca 



ccaaccaggc 
aaaccatgca 
ccaccggtca 
agtgtatcca 
tgcgtcgcgg 
gcgacgcttt 
cggaccgcga 
tgtgcggtgg 
ttgagcgtat 
cattaagcgc 
gcatcggcct 
cgttgggaaa 
tgggtcttga 
tgaaacaact 
tcatccagcg 



gctgagccgt 60 
gtatggcgca 120 
tatgttcggc 180 
cgcttactca 240 
•tttgccaacc 300 
acaaacgctg 360 
cagaatttcg 420 
tcaggcatta 480 
tcatcgtcat 540 
cggagataaa 600 
tgccttccag 660 
acgccagggt 720 
gcaagcccgg 780 
ggctgaacag 840 
taataaataa 900 



<210> 36 
<211> 5051 
<212> DNA 

<213>. Artificial Sequence 



<220> 

<223> MBI operon 
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<400> 36 

gcgcaacgca 

gcttccggct 

ctatgaccat 

taccgggccc 

ggaggaatta 

tggtgaacac 

aacctacctg 

cattagcttt 

aaactcccaa 

tagtcttttg 

ttgtttcctg 

aaagtctact 

ggccttagct 

agaaaacgat 

taccccttca 

agactcacat 

cattccaatg 

cgttcgtgtg 

gggtgaatgt 

cgatgacgag 

aataaatcat 

aaatctgagc 

ttgctctttg 

gaaattgcaa 

ctgtttgtta 

attatttgaa 

cacgaattta 

tgccccaggg 

atttgtagtc 

agggtctgat 

gtaccatata 

tttcattgaa 

ctgcaataga 

ggatagcgtt 

agaagttccc 

tttggcctcc 

tcataattta 

tgtagcggcg 

taatttgcca 

agaagactgg 

gggcgatatt 

tgattcgcat 

atttatggat 

tcagatattt 

agaagttaga 

tggtgccgat 

aggagttctt 

taagcaagat 

atggctggat 

tcttgataaa 

caacatcgca 

ttcgtccata 

tgcacctgag 

tgaaagaact 

ggacgcctca 

tcctacagca 

tgctaagtta 

gtctggttca 

tgaagatggt 



attaatgtga 
cgtatgttgt 
gattacgcca 
cccctcgagg 
accatgtcat 
tctgctgtgt 
ctaataagcg 
aatcataagt 
aaattggcca 
gatccgttgt 
tatatgtttg 
ttacccatcg 
atggcctact 
aagcatatag 
ggaatagata 
aatggaacaa 
atcctaacct 
ttggtcaccg 
gccctacaag 
gctgtagaaa 
ggactgcttg 
gatgatttga 
actttgttac 
gatgatttta 
agcgcaaaaa 
aataaaacta 
ccatggactt 
aaagcgttac 
ggattatcgg 
aagtttgaag 
agtcctaaaa 
aaagttatcg 
aacttgt'tcg 
accgaacatc 
aaaacagggc 
ttttttgtat 
gcacaagttg 
gcagcatatg 
gatattggaa 
aatattacga 
aagaatggtt 
atgccagaaa 
ggactatcta 
gagtctcttg 
gatgcagttg 
atcgaacctc 
acttgcttaa 
gttgatctta 
gtaactcagg 
taggaggtaa 
acccttaagt 
tcagtgactt 
tttgaacgcg 
caaaattgtc 
ttgcccacat 
gctggtttag 
taccaattac 
gcttgtagat 
catgattcca 



gttagctcac 
gtggaattgt 
agcgcgcaat 
tcgacggtat 
taccgttctt 
acaacaagcc 
agtcatctgc 
ggtccatcaa 
aggctcaaca 
tagctcaact 
tttgcctatg 
gtgctgggtt 
tgggggggtt 
tgaatcaatg 
acgctgtggc 
taaacacaaa 
atactagaat 
agaaatttcc 
gcttagagat 
ctaataatga 
tctcaatcgg 
gaattggctc 
gaagagacat 
gttacgagac 
atttgaataa 
ccacaaagca 
cataggaggc 
tagctggtgg 
caagaatgca 
tgcgtgtgaa 
gtggcttcat 
ctaacgtatt 
ttattgatat 
gtggcaacag 
tgggctcctc 
cggacctgga 
ctcattgtca 
gatctatcag 
gtgctactta 
ttaaaagtaa 
cagaaacagt 
gcttgaaaat 
aactagatcg 
agaggaatga 
ccacaattag 
ccgtacaaac 
tacctggtgc 
gggctcaaac 
ctgactgggg 
tactcatgac 
attgggggaa 
tatcgcaaga 
acactttgtg 
tgcgcgacct 
tatctcaatg 
cttcctccgc 
cacagtcaac 
cgttgtttgg 
tggcagtaca 



tcattaggca 
gagcggataa 
taaccctcac 
cgataagctt 
aacttctgca 
tgccgtcgct 
accagatact 
tgatttcaat 
agccaccgat 
atccgaatcc 
cccccatgcc 
gggctcaagc 
aataggatct 
ggccttcata 
cacttatggt 
caattttaag. 
tccaaggtct 
tgaagttatg 
catgactaag 
actgtatgaa 
tgtttctcat 
cacaaaactt 
tactcaagag 
atttgaaaca 
agatcttaaa 
acaaattgac 
agatcaaatg 
atatttagtt 
tgctgtagcc 
aagtaaacaa 
tcctgtttcg 
tagctacttt 
tttctctgat 
aagattgagt 
ggcaggttta 
aaataatgta 
agctcagggt 
atatagaaga 
cggcagtaaa 
ccatttacct 
aaaactggtc 
atatacagaa 
cttacacgag 
ctgtacctgt 
acgttccttt 
tagcttattg 
tggtggttat 
cgctaatgac 
•tgttaggaaa 
cgtttacaca 
aagggacacg 
tgacctcaga 
gttaaatgga 
acgccaatta 
gaaactccac 
tgctggcttt 
ttcagaaata 
cggatacgtg 
aatcgcagac 



ccccaggctt 
caatttcaca 
taaagggaac 
gatatcgaat 
ccgggaaagg 
gctagtgtgt 
attgaattgg 
gccatcaccg 
ggcttgtctc 
ttccactacc 
aagaatatta 
gcctctattt 
aatgacttgg 
ggtgaaaagt 
aatgccctgc 
ttcttagatg 
acaaaagatc 
aagccaattc 
ttaagtaaat 
caactattgg 
cctggattag 
accggtgctg 
caaattgaca 
gacttgggtg 
atcaaatccc 
gatctattat 
tcagagttga 
ttagatacaa 
catccttacg 
tttaaagatg 
ataggcggat 
aaacctaaca 
gatgcctacc 
tttcattcgc 
gtcacagttt 
gacaaatata 
aaaattggaa 
ttcccacccg 
ctggcgcatt 
tcgggattaa 
cagaaggtaa 
ctcgatcatg 
actcatgacg 
caaaagtatc 
agaaaaataa 
gatgattgcc 
gacgccattg 
aaaagatttt 
gaaaaagatc 
gcatccgtta 
aagttgaatc 
acgttgacct 
gaaccacaca 
agaaaggaaa 
attgtctccg 
gctgcattgg 
tctagaatag 
gcctgggaaa 
agctctgact 



tacactttat 
caggaaacag 
aaaagctggg 
tcctgcagta 
ttattatttt 
ctgcgttgag 
acttcccgga 
aggatcaagt 
aggaactcgt 
atgcagcgtt 
agttttcttt 
ctgtatcact 
aaaagctgtc 
gtattcacgg 
tatttgaaaa 
atttcccagc 
ttgttgctcg 
tagatgccat 
gtaaaggcac 
aattgataag 
aacttattaa 
gtggcggcgg 
gcttcaaaaa 
ggactggctg 
tagtattcca 
tgccaggaaa 
gagccttcag 
aatatgaagc 
gttcattgca 
gggagtggct 
ctaagaaccc 
tggacgacta 
attctcagga 
acagaattga 
taactacagc 
gagaagttat 
gcgggtttga 
cattaatctc 
tggttgatga 
ctttatggat 
aaaattggta 
caaattctag 
attacagcga 
ctgaaatcac 
ctaaagaatc 
agaccttaaa 
cagtgattac 
ctaaggttca 
cggaaactta 
ccgcacccgt 
tgcccaccaa 
ctgcggctac 
gcatcgacaa 
tggaatcgaa 
aaaataactt 
tctctgcaat 
caagaaaggg 
tgggaaaagc 
ggcctcagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3'360 

3420 

3480 

3540 
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gaaagcttgt 
gcaattgacc 
gagatttgaa 
aacaatgatg 
ctacatgaat 
cggagaaaca 
agctgaaaat 
atgggacaag 
taactttact 
aactcaagtc 
tctaccaaag 
attttattga 
gcagacaccc 
ttagttaccc 
tgtgggcacc 
gagcttggcg 
accgatccga 
agtgcgttac 
gtattacacg 
acaaatcgcg 
tccactagtt 
gtattacgcg 
tacccaactt 
ggcccgcacc 
agcgttaata 
caataggccg 



gtcctagttg 
gtggcaacct 
gtcatgcgta 
gattccaact 
gacacttcca 
atcgttgcat 
gagtcgaaac 
aaatttacta 
gcacgtgaat 
ggttcaggcc 
gaataactgc 
atgcacaggg 
gcttacatct 
gccgcgcact 
cacaactggg 
tggaaattac 
gtggcattgt 
agatcaatga 
gtattgatgc 
aagccagaaa 
ctagagcggc 
cgctcactgg 
aatcgccttg 
gatcgccctt 
ttttgttaaa 



tcagcgatat 
ccgaactatt 
aagccattgt 
ctttccatgc 
agcgtatcat 
acacgtttga 
tctttgcatt 
ctgagcagct 
tggatcttga 
cacaagaaac 
agcccgggag 
agttcccacg 
cgcgttctcc 
gagcaaaaaa 
agaaagcaac 
gcctcctgaa 
ggaaaatgaa 
tgatgaagtg 
cacgccgtgg 
acgattatct 
cgccaccgcg 
ccgtcgtttt 
cagcacatcc 
cccaacagtt 
attcgcgtta 



taaaaaggat 

taaagaaaga 

tgaaaaagat 

cacatgtttg 

cagttggtgc 

tgcaggtcca 

tatctataaa 

tgaggctttc 

gttgcaaaag 

aaacgaatct ' 

gaggattact 

ggtacgctgg 

agttggctgt 

gcatggcctg 

gaagacgcag 

tctatctatc 

gtgtgtccgg 

atggattatc 

gcgttcagtc 

gcatttaccc 

gtggagctcc 

acaacgtcgt 

ccctttcgcc 

gcgcagcctg 

aatttttgtt 



gtgagttcca 
attgaacatg 
ttcgccacct 
gactctttcc 
cacaccatta 
aatgctgtgt 
ttgtttggct 
aaccatcaat 
gatgttgcca 
ttgattgacg 
atatgcaaac 
aaaagtatgc 
ttaatgccaa 
gcgtgtggac 
tgatccgccg 
ctgactttcg 
tatttgccgc 
aatggtgtga 
cgtggatggt 
agcttaaata 
aattcgccct 
gactgggaaa 
agctggcgta 
aatggcgaat 
aaatcagctc 



ctcagggtat 
tcgtaccaaa 
ttgcaaagga 
ctccaatatt 
atcagtttta 
tgtactactt 
ctgttcctgg 
ttgaatcatc 
gagtgatttt 
caaagactgg 
ggaacacgtc 
cgcacacacg 
aggacaatta 
taactcggtt 
ttgccgttat 
ctaccgcgcc 
acgcaccact 
tttagcagat 
gatgcaggcg 
acccggggga 
atagtgagtc 
accctggcgt 
atagcgaaga 
ggaaattgta 
attttttaac 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5051 



<210> 37 
<211> 5963 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MBIS operon 



<400> 37 

gcgcaacgca 

gcttccggct 

ctatgaccat 

taccgggccc 

ggaggaatta 

tggtgaacac 

aacctacctg 

cattagcttt 

aaactcccaa 

tagtcttttg 

ttgtttcctg 

aaagtctact 

ggccttagct 

agaaaacgat 

taccccttca 

agactcacat 

cattccaatg 

cgttcgtgtg 

gggtgaatgt 

cgatgacgag 

aataaat.cat 

aaatctgagc 

ttgctctttg 



attaatgtga 
cgtatgttgt 
gattacgcca 
cccctcgagg 
accatgtcat 
tctgctgtgt 
ctaataagcg 
aatcataagt* 
aaattggcca 
gatccgttgt 
tatatgtttg. 
ttacccatcg 
atggcctact 
aagcatatag 
ggaatagata 
aatggaacaa 
atcctaacct 
ttggtcaccg 
gccctacaag 
gctgtagaaa 
ggactgcttg 
gatgatttga 
actttgttac 



gttagctcac 
gtggaattgt 
agcgcgcaat 
tcgacggtat 
taccgttctt 
acaacaagcc 
agtcatctgc 
ggtccatcaa 
aggctcaaca 
tagctcaact 
tttgcctatg 
gtgctgggtt 
tgggggggtt 
tgaatcaatg 
acgctgtggc 
taaacacaaa 
atactagaat 
agaaatttcc 
gcttagagat 
ctaataatga 
tctcaatcgg 
gaattggctc 
gaagagacat 



tcattaggca 
gagcggataa 
taaccctcac 
cgataagctt 
aacttctgca 
tgccgtcgct 
accagatact 
tgatttcaat 
agccaccgat 
atccgaatcc 
cccccatgcc 
gggctcaagc 
aataggatct 
ggccttcata 
cacttatggt 
caattttaag 
tccaaggtct 
tgaagttatg 
catgactaag 
actgtatgaa 
tgtttctcat 
cac.aaaactt 
tactcaagag 



ccccaggctt 
caatttcaca 
taaagggaac 
gatatcgaat 
ccgggaaagg 
gctagtgtgt 
attgaattgg 
gccatcaccg 
ggcttgtctc 
ttccactacc 
aagaatatta 
gcctctattt 
aatgacttgg 
ggtgaaaagt 
aatgccctgc 
ttcttagatg 
acaaaagatc 
aagccaattc 
ttaagtaaat 
caactattgg 
cctggattag 
accggtgctg 
caaattgaca 



tacactttat 
caggaaacag 
aaaagctggg 
tcctgcagta 
ttattatttt 
ctgcgttgag 
acttcccgga 
aggatcaagt 
aggaact'cgt 
atgcagcgtt 
agttttcttt 
ctgtatcact 
aaaagctgtc 
gtattcacgg 
tatttgaaaa 
atttcccagc 
ttgttgctcg 
tagatgccat 
gtaaaggcac 
aattgataag 
aacttattaa 
gtggcggcgg 
gcttcaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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gaaattgcaa gatgatttta gttacgagac 
ctgtttgtta agcgcaaaaa atttgaataa 
attatttgaa aataaaacta ccacaaagca 
cacgaattta ccatggactt cataggaggc 
tgccccaggg aaagcgttac tagctggtgg 
atttgtagtc ggattatcgg caagaatgca 
agggtctgat aagtttgaag tgcg.tgtgaa 
gtaccatata agtcctaaaa gtggcttcat 
tttcattgaa aaagttatcg ctaacgtatt 
ctgcaataga aacttgttcg ttattgatat 
ggatagcgtt accgaacatc gtggcaacag 
agaagttccc aaaacagggc tgggctcctc 
tttggcctcc ttttttgtat cggacctgga 
tcataattta gcacaagttg ctcattgtca 
tgtagcggcg gcagcatatg gatctatcag 
taatttgcca gatattggaa gtgctactta 
agaagactgg aatattacga ttaaaagtaa 
gggcgatatt aagaatggtt cagaaacagt 
tgattcgcat atgccagaaa gcttgaaaat 
atttatggat ggactatcta aactagatcg 
tcagatattt gagtctcttg agaggaatga 
agaagttaga gatgcagttg ccacaattag 
tggtgccgat atcgaacctc ccgtacaaac 
aggagttctt acttgcttaa tacctggtgc 
taagcaagat gttgatctta gggctcaaac 
atggctggat gtaactcagg ctgactgggg 
tcttgataaa taggaggtaa tactcatgac 
caacatcgca acccttaagt attgggggaa 
ttcgtccata tcagtgactt tatcgcaaga 
tgcacctgag tttgaacgcg acactttgtg 
tgaaagaact caaaattgtc tgcgcgacct 
ggacgcctca ttgcccacat tatctcaatg 
tcctacagca gctggtttag cttcctccgc 
tgctaagtta taccaattac cacagtcaac 
gtctggttca gcttgtagat cgttgtttgg 
tgaagatggt catgattcca tggcagtaca 
gaaagcttgt gtcctagttg tcagcgatat 
gcaattgacc gtggcaacct ccgaactatt 
gagatttgaa gtcatgcgta aagccattgt 
aacaatgatg gattccaact ctttccatgc 
ctacatgaat gacacttcca agcgtatcat 
cggagaaaca ategttgcat acacgtttga 
agctgaaaat gagtcgaaac tctttgcatt 
atgggacaag aaatttacta ctgagcagct 
taactttact gcacgtgaat tggatcttga 
aactcaagtc ggttcaggcc cacaagaaac 
tctaccaaag gaataactgc agcccgggag 
attttattga atgcacaggg agttcccacg 
gcagacaccc gcttacatct cgcgttctcc 
ttagttaccc gccgcgcact gagcaaaaaa 
tgtgggcacc cacaactggg agaaagcaac 
gagcttggcg tggaaattac gcctcctgaa 
accgatccga gtggcattgt ggaaaatgaa 
agtgcgttac agatcaatga tgatgaagtg 
gtattacacg gtattgatgc cacgccgtgg 
acaaatcgcg aagccagaaa. acgattatct 
tccactagtt ctagagcggc cgccaccgcg 
caactcgaag cctgcgttaa gcaggccaac 
ccctttcaga acactcccgt ggtcgaaacc 
cgcctgcgac ctttcctggt ttatgccacc 
ctggacgcac ccgctgccgc cgttgagtgt 



atttgaaaca gacttgggtg ggactggctg 1440 
agatcttaaa atcaaatccc tagtattcca 1500 
acaaattgac gatctattat tgccaggaaa 1560 
agatcaaatg tcagagttga gagccttcag 1620 
atatttagtt ttagatacaa aatatgaagc 1680 
tgctgtagcc catccttacg gttcattgca 1740 
aagtaaacaa tttaaagatg gggagtggct 1800 
tcctgtttcg ataggcggat ctaagaaccc 1860 
tagctacttt aaacctaaca tggacgacta 1920 
tttctctgat gatgcctacc . attctcagga 1980 
aagattgagt tttcattcgc acagaattga 2040 
ggcaggttta gtcacagttt taactacagc 2100 
aaataatgta gacaaatata gagaagttat 2160 
agctcagggt aaaattggaa gcgggtttga 2220 
atatagaaga ttcccacccg cattaatctc 2280 
cggcagtaaa ctggcgcatt tggttgatga 2340 
ccatttacct tcgggattaa ctttatggat 2400 
aaaactggtc cagaaggtaa aaaattggta 2460 
atatacagaa ctcgatcatg caaattctag 2520 
cttacacgag actcatgacg attacagcga 2580 
ctgtacctgt caaaagtatc ctgaaatcac 2640 
acgttccttt agaaaaataa ctaaagaatc 2700 
tagcttattg gatgattgcc agaccttaaa 2760 
tggtggttat gacgccattg cagtgattac 2820 
cgctaatgac aaaagatttt ctaaggttca 2880 
tgttaggaaa gaaaaagatc cggaaactta 294 0 
cgtttacaca gcatccgtta ccgcacccgt 3000 
aagggacacg aagttgaatc tgcccaccaa 3060 
tgacctcaga acgttgacct ctgcggctac 3120 
gttaaatgga gaaccacaca gcatcgacaa 3180 
acgccaatta agaaaggaaa tggaatcgaa 3240 
gaaactccac attgtctccg aaaataactt 3300 
tgctggcttt gctgcattgg tctctgcaat 3360 
ttcagaaata tctagaatag caagaaaggg 3420 
cggatacgtg gcctgggaaa tgggaaaagc 3480 
aatcgcagac agctctgact ggcctcagat 3540 
taaaaaggat gtgagttcca ctcagggtat 3600 
taaagaaaga attgaacatg tcgtaccaaa 3660 
tgaaaaagat ttcgccacct ttgcaaagga 3720 
cacatgtttg gactctttcc ctccaatatt 3780 
cagttggtgc cacaccatta atcagtttta 3840 
tgcaggtcca aatgctgtgt tgtactactt 3900 
tatctataaa ttgtttggct ctgttcctgg 3960 
tgaggctttc aaccatcaat ttgaatcatc 4020 
gttgcaaaag g.atgttgcca- gagtgatttt 4080 
aaacgaatct ttgattgacg caaagactgg 4140 
gaggattact atatgcaaac ggaacacgtc 4200. 
ggtacgctgg aaaagtatgc cgcacacacg 4260 
agttggctgt ttaatgccaa aggacaatta 4320 
gcatggcctg gcgtgtggac taactcggtt 4380 
gaagacgcag tgatccgccg ttgccgttat 4440 
tctatctatc ctgactttcg ctaccgcgcc 4500 
gtgtgtccgg tatttgccgc acgcaccact 4560 
atggattatc aatggtgtga tttagcagat 4620 
gcgttcagtc cgtggatggt gatgcaggcg 4680 
gcatttaccc agcttaaata acccggggga 4740 
gaggaggaat gagtaatgga ctttccgcag 4800 
caggcgctga gccgttttat cqccccactq. 4860 
atgcagtatg gcgcattatt aggtggtaag 4 920 
ggtcatatgt tcggcgttag cacaaacacg 4980 
atccacgctt actcattaat tcatgatgat 5040 
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ttaccggcaa 
ggcgaagcaa 
agcgatgccg 
gcgagcgcca 
ggcaaacacg 
attcgcgccg 
ccggtactcg 
ctggatgtgg 
ggtaaaagta 
ctgatcgacg 
gcactggaag 
ctatagtgag 
aaaccctggc 
taatagcgaa 
atggaaattg 
tcatttttta 



tggatgatga 
acgcgattct 
atatgccgga 
gtggtattgc 
tacctctgga 
ccgttcgcct 
acaagtatgc 
tgggagatac 
cctaccctgc 
atgcccgtca 
cgctagcgga 
tcgtattacg 
gttacccaac 
gaggcccgca 
taagcgttaa 
accaataggc 



cgatctgcgt 
cgctggcgac 
agtgtcggac 
cggaatgtgc 
cgcgcttgag 
tggtgcatta 
agagagcatc 
tgcaacgttg 
acttctgggt 
gtcgctgaaa 
ctacatcatc 
cgcgctcact 
ttaatcgcct 
ccgatcgccc 
tattttgtta 
cga 



cgcggtttgc 
gctttacaaa 
cgcgacagaa 
ggtggtcagg 
cgtattcatc 
agcgccggag 
ggccttgcct 
ggaaaacgcc 
cttgagcaag 
caactggctg 
cagcgtaata 
ggccgtcgtt 
tgcagcacat 
ttcccaacag 
aaattcgcgt 



caacctgcca 

cgctggcgtt 

tttcgatgat 

cattagattt 

gtcataaaac 

ataaaggacg 

tccaggttca 

agggtgccga . 

cccggaagaa 

aacagtcact 

aataagagct 

ttacaacgtc 

ccccctttcg 

ttgcgcagcc 

taaatttttg 



tgtgaagttt 
ctcgatttta 
ttctgaactg 
agacgcggaa 
cggcgcattg 
tcgtgctctg 
ggatgacatc 
ccagcaactt 
agcccgggat 
cgatacctcg 
ccaattcgcc 
gtgactggga 
ccagctggcg 
tgaatggcga 
ttaaatcagc 



5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
5963 
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